Stack Overflow Query Outcome Prediction
نویسندگان
چکیده
Stack Overflow’s core mission is to create an online encyclopedia for all programming knowledge. In order to ensure quality content in the face of rapid growth, community moderators frequently close low quality questions, often asked by newcomers. In order to alleviate moderator burden and ease newcomers’ transition, we devise two classifiers to predict 1) whether a question will be closed and if close 2) its reason for closure. We train our models using logistic regression, SVMs, and boosting before selecting the optimal classifier. We found that the adaptive boosting algorithm best classified whether a question would be closed, whereas lasso-regulated logistic regression best classified the reason for closure. Our next steps to improve our classifiers include using word vectors, splicing the data by time period, and extracting more features from code segments.
منابع مشابه
StaQC: A Systematically Mined Question-Code Dataset from Stack Overflow
Stack Overflow (SO) has been a great source of natural language questions and their code solutions (i.e., question-code pairs), which are critical for many tasks including code retrieval and annotation. In most existing research, question-code pairs were collected heuristically and tend to have low quality. In this paper, we investigate a new problem of systematically mining question-code pairs...
متن کاملImproving Stack Overflow Tag Prediction Using Eye Tracking
I) Goals and Purpose Software developers use Stack Overflow to post questions and answers related to programming and computer science problems they need to solve. Questions such as seeking input on some efficient and time-saving methods of coding a particular program, getting help on solving various bottlenecks in coding are commonly seen. When users submit questions on Stack Overflow they need...
متن کاملEmbedded Emotion-based Classification of Stack Overflow Questions Towards the Question Quality Prediction
Software developers often ask questions in Stack Overflow Q & A site, and their posted questions sometimes do not meet the standard guidelines. As a consequence, some of the questions are edited by expert users, some of them are down-voted, or some are even deleted permanently. Besides, the users (i.e., developers) might not get the expected solutions for their problems. In this paper, we study...
متن کاملAdvantages Of Object Relational Database Model
This article explores the differences between relational databases (RDBMS) and You should have looked into the property-graph model and optionally read especially for join heavy queries, the minutes to milliseconds advantage that Even object-relational mappers use SQL under the hood to talk to the database. When you write applications that communicate with a relational database, your created a ...
متن کاملGitHub and Stack Overflow: Analyzing Developer Interests Across Multiple Social Collaborative Platforms
Increasingly, software developers are using a wide array of social collaborative platforms for software development and learning. In this work, we examined the similarities in developer’s interests within and across GitHub and Stack Overflow. Our study finds that developers share common interests in GitHub and Stack Overflow; on average, 39% of the GitHub repositories and Stack Overflow questio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016